On the identification of relevant degradation indicators in super wideband listening quality assessment models

نویسندگان

  • Sibiri Tiémounou
  • Régine Le Bouquin-Jeannès
  • Vincent Barriac
چکیده

Recently, new objective speech quality evaluation methods, designed and adapted to new high voice quality contexts, have been developed. One interest of these methods is that they integrate voice quality perceptual dimensions reflecting the effects of frequency-response distortions, discontinuities, noise and/or speech level deviations respectively. This makes it possible to use these methods also to provide diagnostic information about specific aspects of the transmission systems' quality, as perceived by end-users. In this paper, we present and analyze in depth two of these approaches namely POLQA (Perceived Objective Listening Quality Assessment) and DIAL (Diagnostic Instrumental Assessment of Listening quality), in terms of quality degradation indicators related to the perceptual dimensions these models could embed. The main goal of our work is to find and propose the most robust quality degradation indicators to reliably characterize the impact of degradations relative to the perceptual dimensions described above and to identify the underlying technical causes in super wideband telephone communications [50, 14 000] Hz. To do so, the first step of our study was to identify in both models the correspondence between perceptual dimensions and quality degradation indicators. Such indicators could be either present in the model itself or derived from our own investigation of the model. In a second step, we analyzed the performance and robustness of the identified quality degradation indicators on speech samples only impaired by one degradation (representative of one perceptual dimension) at a time. This study highlighted the reliability of some of the quality degradation indicators embedded in the two models under study and stood for a first step in the evaluation of performance of these indicators to quantify the degradation for which they were designed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnostic Instrumental Speech Quality Assessment in a Super-Wideband Context

Speech quality models usually estimate the integral quality of the degraded speech files. Such quality values do not inform system developers and telephone service providers on the perceived degradation introduced by the system under study. This paper describes a new intrusive speech quality model, called Diagnostic Instrumental Assessment of Listening quality (DIAL), providing diagnostic infor...

متن کامل

An intrusive super-wideband speech quality model: DIAL

The intrusive speech quality model standardized by the ITU–T shows some limits in its quality predictions, especially in a wideband transmission context. They are mainly caused by strong differences in perceived quality when speech is transmitted over different telephone networks. Instrumental methods should provide reliable estimations of the integral speech quality over the entire perceptual ...

متن کامل

Validating Perceptual Objective Listening Quality Assessment Methods on the Tonal Language Igbo

In recent years a great deal of effort has been expended to develop methods that determine speech quality through the use of comparative algorithms. These methods are designed to calculate an index value of quality that correlates to a mean opinion score given by human subjects in evaluation sessions. In this paper, we validate Perceptual Evaluation of Speech Quality (PESQ) ITU-T Recommendation...

متن کامل

نقدی بر مدل ایرانی ارزیابی پتانسیل بیابان‌زایی(IMDPA)

Drylands occupied a large area of lands on Earth and a large percentage of the population are living in these areas. Land degradation or desertification is one of the biggest problems in arid zones. In general, little effort for mapping land degradation at regional to global scales has been made. Recent efforts to assess desertification in Iran led to devise the Iranian Model of Desertification...

متن کامل

Super-Wideband Bandwidth Extension for Wideband Audio Codecs Using Switched Spectral Replication and Pitch Synthesis

This paper describes a new bandwidth extension algorithm which is targeted at high quality audio communication over IP networks. The algorithm is part of the Huawei/ETRI candidate for the ITU-T super-wideband (SWB) extensions of Rec. G.729.1 and G.718. In the SWB candidate codec, the 7-14 kHz frequency band of speech and audio signals is represented in terms of temporal and spectral envelopes. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 55  شماره 

صفحات  -

تاریخ انتشار 2013